CDS

Accession Number TCMCG041C29287
gbkey CDS
Protein Id XP_010277074.1
Location complement(join(363159..363281,365209..365452,367502..367737,367965..368155,368331..368568,369301..369532,370784..370812))
Gene LOC104611633
GeneID 104611633
Organism Nelumbo nucifera

Protein

Length 430aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264089
db_source XM_010278772.1
Definition PREDICTED: uncharacterized protein LOC104611633 isoform X3 [Nelumbo nucifera]

EGGNOG-MAPPER Annotation

COG_category C
Description formamidase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00524        [VIEW IN KEGG]
KEGG_rclass RC02432        [VIEW IN KEGG]
RC02810        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01455        [VIEW IN KEGG]
EC 3.5.1.49        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00460        [VIEW IN KEGG]
ko00630        [VIEW IN KEGG]
ko00910        [VIEW IN KEGG]
ko01200        [VIEW IN KEGG]
map00460        [VIEW IN KEGG]
map00630        [VIEW IN KEGG]
map00910        [VIEW IN KEGG]
map01200        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGACATTAAACATGCTGATCTTTCAATTTACTGGTTCAGCAATGCCTCCTCCCACTCCAAGAATACTAGTCCCCATAGACCTCAGGAAGAAGCCATGGGAGCAGAAGCTACCACTCCATAACCGTTGGCACCCACTGATACCCGCAGTTGCTGATGTTAGAGTAGGTGATATCTTCAGAATTGAGACGCTAGACTGGACTGGAGGTGCCATCACAGATAATGCATCCGCAATAGATGTGAAAACCATTGATCTCTCAACTGTGCATTATCTCAGCGGTCCCATAAAAGTTTCAGATAAAGATGGCATTCCAGCCAAGCCAGGTGATCTTCTTGCGGTTGAAATCTGTAATTTGGGCCCTCTTACTGGAGATGAATGGGGTTATACAGCAACATTTGACAGAGAAAATGGAGGTGGTTTTCTGACAGATCATTTCCCTTGTGCAACCAAAGCAATTTGGTATTTTGAGGGAATATATGCTTACTCTCCTCAGATACCAGGTGTACGATTTCCAGGTTTAACTCACCCTGGAATAATTGGAACTGCTCCATCAATGGAACTCCTAAGCATATGGAATGAAAGAGAGAAACAACTAGAAGAAAATGGTCCCCAATCTCTGAAGTTGTGTGAGGTCCTGCACTCACGACCACTGGCAAACCTACCAACATCCAAAGGTTGTCTTCTTGGAAAGATCCAAGAAGGAACTCCAGAATGGGAAAAGATTGCAAGAGAGGCTGCACGGACAATCCCAGGAAGAGAAAATGGAGGAAATTGTGACATCAAGAATCTCAGTAAGGGTTCAAAGATATATCTTCCAGTATTTGTAGATGGAGCAAATTTCAGTACGGGTGACATGCACTTCTCCCAGGGCGATGGAGAAGTCTCCTTCTGTGGAGCAATAGAGATGAGTGGATTTCTAGAGCTCAAGTGTGAAATCATAAGGGGAGGGATGAAAGAGTACCTAACACCAATGGGTCCCACTCCTCTTCATGTGAACCCAATCTTTGAAATAGGCCCAGTCGAGCCAAGGTTCTCAGAATGGCTGGTTTTTGAAGGCATCAGTGTTGATGAGAGTGGGAGGCAACATTACCTTGACGCAAGTGTTGCATACAAGCGTGCAGTGCTCAATGCCATTGACTATCTCAACAAATTTGGATACTCCAAAGAGCAGGATGTTCGCCCCAAGACCAACAAAGTACCAGTTGGGCCCCGTCTACTCAGGAAACCTGATGTTCTAAAATGCACTTACGATGGCAACTTACCCACTACAACGAACCCTGCTGGCAAAACATAG
Protein:  
MTLNMLIFQFTGSAMPPPTPRILVPIDLRKKPWEQKLPLHNRWHPLIPAVADVRVGDIFRIETLDWTGGAITDNASAIDVKTIDLSTVHYLSGPIKVSDKDGIPAKPGDLLAVEICNLGPLTGDEWGYTATFDRENGGGFLTDHFPCATKAIWYFEGIYAYSPQIPGVRFPGLTHPGIIGTAPSMELLSIWNEREKQLEENGPQSLKLCEVLHSRPLANLPTSKGCLLGKIQEGTPEWEKIAREAARTIPGRENGGNCDIKNLSKGSKIYLPVFVDGANFSTGDMHFSQGDGEVSFCGAIEMSGFLELKCEIIRGGMKEYLTPMGPTPLHVNPIFEIGPVEPRFSEWLVFEGISVDESGRQHYLDASVAYKRAVLNAIDYLNKFGYSKEQDVRPKTNKVPVGPRLLRKPDVLKCTYDGNLPTTTNPAGKT